Engineering posts about Delta Lake
Curated summaries and key learnings for engineers working with Delta Lake.
PipelineIQ: Forward‑Looking Sales Intelligence That Drives Action
PipelineIQ is an AI-driven solution designed to enhance sales intelligence by transforming messy CRM data into actionable insights. Built on Databricks, it utilizes Foundation Model APIs and Delta...
Expanded interoperability with Unity Catalog Open APIs
The article elaborates on the advancements brought by Unity Catalog's Open APIs, which enhance interoperability in data management by allowing enterprises to maintain a single copy of data while...
The Convergence of Open Table Formats and Open Catalogs: Catalog Commits is Generally Available
The article announces the General Availability of Catalog Commits, a significant enhancement for Delta Lake and Unity Catalog that aims to unify the lakehouse architecture by addressing coordination...
Mercedes-Benz Builds a Cross-Cloud Data Mesh with Delta Sharing and Intelligent Replication, Cutting Costs by 66%
Mercedes-Benz has successfully implemented a cross-cloud data mesh utilizing Databricks Delta Sharing and Delta Deep Clone to facilitate secure and cost-effective data exchange between AWS and Azure....
From Tribal Knowledge to Instant Answers: Building Reffy on Databricks
The article discusses the development of Reffy, an application built on Databricks to streamline the discovery of customer references. It addresses the challenges of accessing tribal knowledge within...
Nasdaq eVestment Data Now on Databricks Marketplace
The article presents the availability of Nasdaq eVestment data through Delta Sharing on Databricks Marketplace, enabling asset managers to access live, query-ready institutional investor data. This...
Announcing General Availability of Zerobus Ingest, part of Lakeflow Connect
Zerobus Ingest has been announced as a General Availability service, providing a fully managed, serverless solution for streaming data directly into Delta tables, thus eliminating the need for...
Self-Optimizing Football Chatbot Guided by Domain Experts on Databricks
This article outlines the development of a self-optimizing football chatbot designed to assist coaches by analyzing play-by-play data and providing insights based on expert feedback. The architecture...
Delta Lake Explained: Boost Data Reliability in Cloud Storage
Delta Lake is an open-source storage layer that enhances data lakes by providing ACID transactions, schema enforcement, and time travel capabilities, transforming unreliable data lakes into...
2025 in Review: Databricks SQL, faster for every workload
In 2025, Databricks SQL achieved significant performance enhancements, delivering up to 40% faster execution across various workloads such as BI, ETL, and spatial analytics. These improvements are...
Arctic Wolf’s Liquid Clustering Architecture Tuned for Petabyte Scale
Arctic Wolf has implemented a liquid clustering architecture to optimize the processing of over one trillion security events daily, resulting in enhanced query performance and data freshness. By...
Completing the Lakehouse Vision: Open Storage, Open Access, Unified Governance
The article outlines the advancements in data governance within lakehouse architectures, specifically through the introduction of Unity Catalog, which unifies attribute-based access control across...